Model Selection

Efficient inference optimization

# Efficient inference optimization

Helium 1 2b Q8 0 GGUF

This is a GGUF format model converted from kyutai/helium-1-2b, supporting multiple European languages.

Large Language Model Supports Multiple Languages

Qwen3 0.6B Base

Qwen3-0.6B-Base is the latest generation of large language models in the Tongyi Qianwen series, offering a range of dense models and Mixture of Experts (MoE) models.

Large Language Model

Bitnet B1.58 2B 4T GGUF

A 1.58-bit quantized large language model developed by Microsoft, designed for efficient inference, offering IQ2_BN and IQ2_BN_R4 quantization versions

Large Language Model

GLM Z1 9B 0414 Q4 K M GGUF

This model is a GGUF format conversion of THUDM/GLM-Z1-9B-0414, supporting Chinese and English text generation tasks.

Large Language Model Supports Multiple Languages

Hunyuan 7B Instruct 0124

Hunyuan-7B is an open-source large language model released by Tencent. It has the ability to process 256K long texts and uses the Grouped Query Attention (GQA) mechanism, performing excellently among Chinese 7B dense models.

Large Language Model

Transformers English

Deepseek R1 Distill Llama 70B GGUF

DeepSeek-R1-Distill-Llama-70B is a 70B parameter large language model developed by the DeepSeek team based on the Llama architecture. It is optimized through distillation technology and supports efficient inference and fine-tuning.

Large Language Model English

Deepthink Reasoning 7B GGUF

The Llamacpp imatrix quantization version of Deepthink-Reasoning-7B, offering multiple quantization types to meet different hardware requirements.

Large Language Model English

Gemma 2b It Q4 K M GGUF

The GGUF quantized version of the Gemma-2b-it model, suitable for local inference and supporting text generation tasks.

Large Language Model

Jamba is a state-of-the-art hybrid SSM-Transformer large language model that combines the advantages of Mamba architecture with Transformer, supporting 256K context length, surpassing models of similar scale in throughput and performance.

Large Language Model

Whisper Telugu Medium

Telugu speech recognition model fine-tuned based on OpenAI Whisper-medium, trained on multiple public Telugu ASR datasets

Speech Recognition Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase